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ABSTRACT 

Reported is a study of various aspects of test 
construction by science teachers. An attempt was made to examine: (1) 

variables affecting test-construction practice of pre-service science 
teachers; (2) attitudes of teachers as a result of instruction; and 
(3) relationship between test-construction behavior and attitudes, in 
addition, an attempt was made to compare testing practices of 
pre- service science teachers with another population of science 
teachers. The sample was drawn from secondary science majors enrolled 
in the science methods course and in student teaching. Two sample 
tests, obtained from pre-service science teachers, were analyzed and 
categorized according to the six levels of "Bloom's Taxonomy." The 
pre-. Post-, and delayed post-instructional data for determining 
subjects' attitudes toward the "taxonomy" were obtained by means of 
Word Association Scale (included in this report) . The data were 
analyzed by factorial analysis methods. The results showed that tests 
prepared during student teaching contained higher proportions of 
knowledge level items than the tests prepared during the methods 
course. (Author/PS) 
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Introduction 



During the past two decades the emphasis on teaching 



science as an immutable body of knowledge has been supplanted 
by the notion that it ought to be taught as a process of 
acquiring knowledge. The recent curriculum developments that 
emphasize teaching science as inquiry reflect these changes 
in philosophy. Concommitant with these developments has been 
a resurgence of interest in teacher behavior and the effect 
this behavior has on the learner. Testing practices are an 
example of teaching behaviors that warrant closer examination. 
Research has provided evidence demonstrating the Effect of 
questions on the learner. Increased learner achievement is 
one benefit which has been attributed to effective question- 
ing. Numerous educators have stated that the levels of 
thought developed and exhibited by the learner are a function 
of the kinds of questions he encountered during his education. 
Questions have also been recognized as powerful motivational 
devices, influencing how the learner studies and the values 
he places on intellectual activities . 



criticism of educators, questioning practices have not 

changed from those of 6 0 years ago. Teachers persist in 

#Paper presented at the National Association for Research in 
Science Teaching (NARST) Annual Meeting, Chicago, Illinois, April, 1972. 



In spite of the curricular changes and the continued 
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asking questions requiring primarily memory skills. This use 
of predominantly low level questions has a limiting effect on 
the success of the new science curricula because it obscures 
the nature of science and leads the learner toward the mistaken 
impression that science is nothing more than a conglomeration 
o f facts . 

A plausible explanation for the failure of teachers 
to ask higher level questions is that they simply lack the 
necessary knowledge and skills. It is evident that developing 
^h e teachers' ability to use higher level questions is a 
necessary step toward achieving the broader goals of science 
education. Science methods courses should assume the task of 
preparing science teachers to meet the demands placed on them 
by the new curricula and the nature of inquiry in science. 

A means of accomplishing this end is to prepare teachers to 
uS e Bloom's Taxonomy as a model for questioning. 

This study was designed to investigate the effectiveness 
of a method intended to develop improved test construction. 

The purpose of the study was to: 

, • examine the variables affecting test— construction 

practices of pre— service science teachers 

2. examine the effect of instruction on the attitudes 
of pre-service science teachers 

3. examine the relationship between tes t— cons t ruction 
behavior and attitudes 

k. determine whether the testing practices of 

pre-service science teachers were different from a 
population of science teachers. 
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The Judgment of the desired improvement in testing 
practices was based on the premise that: (l) questions effect 

the attitudes of the learner; (2) questions to a large 
extent determine the cognitive skills the learner will develop; 
and (3) the possession of the intellectual skills is central 
to the understanding of science. Therefore, instruction was 
sought to promote a decrease in the use of questions requiring 
only memory skills. 

The scope of the study was limited to secondary science 
majors attending Indiana University, Bloomington, Indiana. 

The study population consisted of secondary science majors who enrolled 
in the methods course during the period of the study. A subset of 
this population, those subsequently enrolling in student teaching, 
was selected for further study. 

PROCEDURES 

I l« » 

Test-Construction Data 

Sample tests representing test-construction behavior 
were obtained during the methods course and during student teaching. 

Sample Test 1 was prepared by all subjects as a required 
assignment for the methods course. The assignment, planning 
student evaluation, stated that the students were to prepare an 
objective-type test that: 

1. could be used with a specific population of 
students that was to be described in detail, 

2. contained a minimum of fifteen mul tiple- choice items 
and was at least twenty items in length, and 

3. covered what, in their judgment, would be an 
appropriate unit of instruction. 
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The students were allowed to choose the content and resource 
materials they wished to use to complete the assignment. The 
students were not aware that the assignment would be used as 
data in a study. 

One hundred and forty-nine tests were obtained during the 
methods course. 

Sample Test 2 was prepared by the subjects while they were 

student teaching. Since the dispersal of student teachers throughout 

the state made individual contact impractical, letters of request 

were dispensed in pre-stamped, return-addressed envelopes 

containing the delayed post-instructional attitude measure form. 

T, Hv*- Subjects to ere 

to prepare a test of a multiple-choice type for use in one of the 
classes he was teaching, and to return the test and completed 
form via mail. The area student teacher coordinators and 
supervising teachers were informed of the request and asked to 

encourage the student to complete the effort while he was student 
teaching . 

Eighty-two of the methods students enrolled in student 
teaching during the period of the study. Attrition caused by a 

fsetors reduced the return to 67 resnonses. Eighty — two 
percent return was judged to be an adequate sample. 

Sample tests 1 and 2 were analyzed and items categorized 
according to the six levels of the Tax o no my . 

Attitude Data 

Observations on the subjects' attitudes toward the 
Taxonomy were obtained by means of a Word Association Scale. 

Pre, post, and delayed post-instructional observations were obtained. 
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Three sets of attitude data were collected. Delayed post- 
instructional observations only were collected from one group of 
subjects. Post and delayed post-instructional observations were 
collected from a second group. Pre and post-instructional data 
were collected from a third group. 

Instrument Construction and Reliability 

Attitudes . Data representing the subjects' attitudes 
toward the Taxonomy were obtained by their responses on the Word 
Association Scale. The scale initially consisted of 67 pairs of 
bipolar adjectives taken from Measurement of Meaning (Osgood, Suci, 
and Tannenbaum, 1957) and the stimulus concept "Bloom's Taxonomy." 
The scale was constructed in the following manner: 

Concept: Bloom’s Taxonomy 

Polar term X 1^: 2: 3. : ii : JL : §.'• 1 Polar term Y 

1. Strongly associated with X 

2. Moderately associated with X 

3. Slightly associated with X 

U. Associated with neither X nor Y 
or 

equally associated with X or Y 

5. Slightly associated with Y 

6. Moderately associated with Y 

7. Strongly associated with Y 

The subjects were instructed to identify the association 
existing between Bloom's Taxonomy and each response pair by 
marking the scale between each nair. 

The scale was first administered to a group of methods 
students. The word pairs having no associative meaning (invoking 
primarily neutral responses) were deleted. The remaining pairs 
were submitted to a panel' of judges familiar with the Taxonomy. 
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The judges were instructed to indicate the pairs having the 
closest relationship to the Taxonomy . From the pairs judged to 
have the closest association, fifteen pairs were selected for the 
final form. The form was comprised of six evaluative pairs, three 
potency pairs, and six oriented-activity pairs. The selected 
pairs were randomized with respect to polarity and 
dimension to construct the final form. The positive end of the 

scale was assigned a value of seven and the negative end of the 

scale was assigned a value of one with five equal intervals between 

the one and seven. The subjects attitude subs cores were obtained 
by summing their responses in each of three dimensions. 

Tests . Two hundred and sixteen tests were collected for 
the study. The items on each were examined and classified according 
to the six categories in The Taxonomy of Educational Objectives , 
Handbook 1; Cognitive Domain (Bloom ed. , 1956) as knowledge, 
comprehension, application, analysis, synthesis, and evaluation. 

A random sample of ten tests was selected to check the 
investigator's reliability in using the Taxonomy . The intra-scorer 
reliability was checked using Scott's Formula for Scoring Reliability. 
Interscorer reliability was checked by establishing a panel of 
three judges, the investigator and two other Judges familiar with 
the Taxonomy , and comparing their independent ratings of the items 
on the ten tests using an analysis of variance estimate of 
Judge's ratings. 
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Analysis of Data and Results 

The data were analyzed to answer the following research 
questions : 

1. Do the students' grade point averages or the 
students' majors affect their test— construction 
behavior during the methods course and during 
student teaching? 

2. Does cognitive instruction in the Taxonomy cause 

a change in attitude that persists during the student 
teaching assignment? 

3. Do the subjects' attitudes toward the Taxonom y 
affect their test-construction behavior? 

U. Do the subjects' test-construction behavior differ 
from the test-construction behavior of science 
teachers cited in an earlier study? 

Question 1 : To answer this question, the subjects 

were ranked on the basis of GPA and classified as high or low using a 
median split, categorized according to major (biology, general 
science, or physical science), and test-construction data were 
collected. The proportions of items occurring at the six levels 
of the Taxonomy were calculated for both sample tests. One criterion 
variable, proportion of knowledge level items, was examined. 

A 2 x 2 x 3 factorial analysis of variance model with 
two levels of repeated measures on the criterion variable, two 
levels of GPA, and three levels of major was generated. 



